fix: default session keep_alive to 5 minutes by DaleSeo · Pull Request #780 · modelcontextprotocol/rust-sdk

DaleSeo · 2026-03-27T23:19:57Z

Motivation and Context

When an HTTP/2 connection silently drops (e.g., an Envoy sidecar sending RST_STREAM with NO_ERROR), the LocalSessionManager still holds the session handle, keeping the worker's event channel open. The session worker never detects the disconnect and runs indefinitely as a zombie. Over time these zombies accumulate and can block servers that iterate over sessions during notifications, eventually causing the server to become unresponsive.

This changes the default SessionConfig::keep_alive from None (infinite) to 5 minutes. After 5 minutes of inactivity, the session worker exits, the transport closes, and downstream servers can detect the peer as closed. Users can still set keep_alive: None to restore the old behavior if needed.

How Has This Been Tested?

Verify the default is applied by checking the SessionConfig::default() value:

use rmcp::transport::streamable_http_server::session::local::SessionConfig;
let config = SessionConfig::default();
assert_eq!(config.keep_alive, Some(std::time::Duration::from_secs(300)));

To manually verify zombie cleanup, start an MCP server using LocalSessionManager::default(), initialize a session via curl, Ctrl+C the curl (creating a zombie), and wait 5 minutes. The server logs should show "keep alive timeout after 300000ms" and the session worker should exit.

Breaking Changes

None. No API signatures changed.

Types of changes

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to change)
Documentation update

Checklist

I have read the MCP Documentation
My code follows the repository's style guidelines
New and existing tests pass locally
I have added appropriate error handling
I have added or updated documentation as needed

Additional context

synapsis1 · 2026-04-13T20:18:53Z

Was the intention that it would log as ERROR? I am seeing:

ERROR rmcp::transport::worker: worker quit with fatal: keep alive timeout after 300000ms, when poll next session event

The good news is that I didn't realize this was happening with my servers (session worker zombies) and hadn't overridden the default of None, but isn't this more of a client error (the client left without saying goodbye)? Perhaps this is more of a WARN, or maybe INFO (in the case the worker was terminated because the client dropped silently). ERROR generates questions, and this is normal behavior.

I didn't want to open up a new issue before understanding if this was the intent.

There is a secondary error that shows up as well, that is simply because of the first:

ERROR rmcp::transport::streamable_http_server::tower: Failed to close session <uuid>: Session error: Session service terminated

fix: default session keep_alive to 5 minutes

042f105

DaleSeo self-assigned this Mar 27, 2026

github-actions bot added T-core Core library changes T-transport Transport layer changes labels Mar 27, 2026

DaleSeo marked this pull request as ready for review March 31, 2026 10:56

DaleSeo requested a review from a team as a code owner March 31, 2026 10:56

alexhancock approved these changes Apr 8, 2026

View reviewed changes

alexhancock merged commit 929441e into main Apr 8, 2026
17 checks passed

alexhancock deleted the fix/default-session-keep-alive branch April 8, 2026 14:36

github-actions bot mentioned this pull request Apr 9, 2026

chore: release v1.4.0 #779

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: default session keep_alive to 5 minutes#780

fix: default session keep_alive to 5 minutes#780
alexhancock merged 1 commit intomainfrom
fix/default-session-keep-alive

DaleSeo commented Mar 27, 2026

Uh oh!

Uh oh!

synapsis1 commented Apr 13, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

DaleSeo commented Mar 27, 2026

Motivation and Context

How Has This Been Tested?

Breaking Changes

Types of changes

Checklist

Additional context

Uh oh!

Uh oh!

synapsis1 commented Apr 13, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

synapsis1 commented Apr 13, 2026 •

edited

Loading